Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
نویسندگان
چکیده
BACKGROUND A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time. One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model. METHODS Data for 637 malignant breast cancer patients admitted in the south-east region of Sweden were analyzed. By using CCA and looking at the structure coefficients (loadings), relationships between tumor specifications and the two outcomes during different time intervals were analyzed and a correlation model was built. RESULTS The analysis successfully detected known predictors for breast cancer recurrence during the first two years and distant metastasis 2-4 years after diagnosis. Nottingham Histologic Grading (NHG) was the most important predictor, while age of the patient at the time of diagnosis was not an important predictor. CONCLUSION In cancer registers with high dimensionality, CCA can be used for identifying the importance of risk factors for breast cancer recurrence. This technique can result in a model ready for further processing by data mining methods through reducing the number of variables to important ones.
منابع مشابه
Simultaneous modeling of multiple recurrences in breast cancer patients
Introduction: Breast cancer is one of the most common recurrence cancers among women. There are several factors that can affect multiple recurrence of this disease. On the other hand, simultaneous examination of the types of relapses will make the results more accurate. The purpose of this study was to use a joint frailty model to model multiple recurrences of breast cancer patients. Materials ...
متن کاملمدل های شکنندگی توام چند متغیره برای مدلبندی رخدادهای بازگشتی چندگانه وکاربرد آن در سرطان پستان
Background: Breast cancer is one of the most common recurrence cancers among women. There are several factors that can affect the multiple recurrence of this disease, which have been studied and recognized in various studies, however, with the fact that these factors are known, can not always accurately predict the incidence of metastasis. On the other hand, simultaneous examination of the type...
متن کاملApplication of Canonical Correlation Analysis for Detecting Risk Factors Leading to Recurrence of Breast Cancer
BACKGROUND Advances in treatment options of breast cancer and development of cancer research centers have necessitated the collection of many variables about breast cancer patients. Detection of important variables as predictors and outcomes among them, without applying an appropriate statistical method is a very challenging task. Because of recurrent nature of breast cancer occurring in differ...
متن کاملDetermining the correlated factors of breast cancer recurrence by Poisson Beta-Weibull non- mixture cure model
Introduction: Therapies for many of diseases, especially cancer, have been improved significantly in the recent years, resulting in an increase in the number of patients who do not experience mortality. Therefore, the application of cure models is more suitable for survival analysis in this population than the usual survival models are. The aim of this study was to estimate the recurrence-free ...
متن کاملAssociation Between Oncotype DX Recurrence Score and Clinicopathological Variables in Breast Cancer Patients
Introduction: Breast cancer is the most common cancer and the leading cause of cancer-related death in women. Clinicopathological variables are important factors in deciding on breast cancer treatment. This study evaluated the association between the recurrence score generated by the Oncotype DX® 21-gene assay and classic clinicopathological variables. Methods: A single-institution retrospect...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- BMC Medical Informatics and Decision Making
دوره 5 شماره
صفحات -
تاریخ انتشار 2005